NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tyche: Making Sense of Property-Based Testing Effectiveness

https://doi.org/10.1145/3654777.3676407

Goldstein, Harrison; Tao, Jeffrey; Hatfield-Dodds, Zac; Pierce, Benjamin C; Head, Andrew (October 2024, ACM)

Software developers increasingly rely on automated methods to assess the correctness of their code. One such method is property-based testing (PBT), wherein a test harness generates hundreds or thousands of inputs and checks the outputs of the program on those inputs using parametric properties. Though powerful, PBT induces a sizable gulf of evaluation: developers need to put in nontrivial effort to understand how well the different test inputs exercise the software under test. To bridge this gulf, we propose Tyche, a user interface that supports sensemaking around the effectiveness of property-based tests. Guided by a formative design exploration, our design of Tyche supports developers with interactive, configurable views of test behavior with tight integrations into modern developer testing workflow. These views help developers explore global testing behavior and individual test inputs alike. To accelerate the development of powerful, interactive PBT tools, we define a standard for PBT test reporting and integrate it with a widely used PBT library. A self-guided online usability study revealed that Tyche’s visualizations help developers to more accurately assess software testing effectiveness.
more » « less
Full Text Available
Etna: An Evaluation Platform for Property-Based Testing (Experience Report)

https://doi.org/10.1145/3607860

Shi, Jessica; Keles, Alperen; Goldstein, Harrison; Pierce, Benjamin C.; Lampropoulos, Leonidas (August 2023, Proceedings of the ACM on Programming Languages)

Property-based testing is a mainstay of functional programming, boasting a rich literature, an enthusiastic user community, and an abundance of tools — so many, indeed, that new users may have difficulty choosing. Moreover, any given framework may support a variety of strategies for generating test inputs; even experienced users may wonder which are better in a given situation. Sadly, the PBT literature, though long on creativity, is short on rigorous comparisons to help answer such questions. We present Etna, a platform for empirical evaluation and comparison of PBT techniques. Etna incorporates a number of popular PBT frameworks and testing workloads from the literature, and its extensible architecture makes adding new ones easy, while handling the technical drudgery of performance measurement. To illustrate its benefits, we use Etna to carry out several experiments with popular PBT approaches in both Coq and Haskell, allowing users to more clearly understand best practices and tradeoffs.
more » « less
Full Text Available
Formalizing Stack Safety as a Security Property

https://doi.org/10.1109/CSF57540.2023.00037

Anderson, Sean Noble; Blanco, Roberto; Lampropoulos, Leonidas; Pierce, Benjamin C.; Tolmach, Andrew (July 2023, 2023 IEEE 36th Computer Security Foundations Symposium (CSF))

Full Text Available
Parsing Randomness

https://doi.org/10.1145/3563291

Goldstein, Harrison; Pierce, Benjamin C. (January 2022, Proceedings of the ACM on programming languages)

Random data generators can be thought of as parsers of streams of randomness. This perspective on generators for random data structures is established folklore in the programming languages community, but it has never been formalized, nor have its consequences been deeply explored. We build on the idea of freer monads to develop free generators, which unify parsing and generation using a common structure that makes the relationship between the two concepts precise. Free generators lead naturally to a proof that a monadic generator can be factored into a parser plus a distribution over choice sequences. Free generators also support a notion of derivative, analogous to the familiar Brzozowski derivatives of formal languages, allowing analysis tools to “preview” the effect of a particular generator choice. This gives rise to a novel algorithm for generating data structures satisfying user-specified preconditions.
more » « less
Full Text Available
C4: verified transactional objects

https://doi.org/10.1145/3527324

Lesani, Mohsen; Xia, Li-yao; Kaseorg, Anders; Bell, Christian J.; Chlipala, Adam; Pierce, Benjamin C.; Zdancewic, Steve (April 2022, Proceedings of the ACM on Programming Languages)

Transactional objects combine the performance of classical concurrent objects with the high-level programmability of transactional memory. However, verifying the correctness of transactional objects is tricky, requiring reasoning simultaneously about classical concurrent objects, which guarantee the atomicity of individual methods—the property known as linearizability—and about software-transactional-memory libraries, which guarantee the atomicity of user-defined sequences of method calls—or serializability. We present a formal-verification framework called C4, built up from the familiar notion of linearizability and its compositional properties, that allows proof of both kinds of libraries, along with composition of theorems from both styles to prove correctness of applications or further libraries. We apply the framework in a significant case study, verifying a transactional set object built out of both classical and transactional components following the technique of transactional predication ; the proof is modular, reasoning separately about the transactional and nontransactional parts of the implementation. Central to our approach is the use of syntactic transformers on interaction trees —i.e., transactional libraries that transform client code to enforce particular synchronization disciplines. Our framework and case studies are mechanized in Coq.
more » « less
Full Text Available
Model-based testing of networked applications

https://doi.org/10.1145/3460319.3464798

Li, Yishuai; Pierce, Benjamin C; Zdancewic, Steve (January 2021, ISSTA ’21: 30th ACM SIGSOFT International Symposium on Software Testing and Analysis)

We present a principled automatic testing framework for application-layer protocols. The key innovation is a domain-specific embedded language for writing nondeterministic models of the behavior of networked servers. These models are defined within the Coq interactive theorem prover, supporting a smooth transition from testing to formal verification. Given a server model, we show how to automatically derive a tester that probes the server for unexpected behaviors. We address the uncertainties caused by both the server's internal choices and the network delaying messages nondeterministically. The derived tester accepts server implementations whose possible behaviors are a subset of those allowed by the nondeterministic model. We demonstrate the effectiveness of this framework by using it to specify and test a fragment of the HTTP protocol, showing that the automatically derived tester can capture RFC violations in buggy server implementations, including the latest versions of Apache and Nginx.
more » « less
Full Text Available
From C to interaction trees: specifying, verifying, and testing a networked server

https://doi.org/10.1145/3293880.3294106

Koh, Nicolas; Li, Yao; Li, Yishuai; Xia, Li-yao; Beringer, Lennart; Honoré, Wolf; Mansky, William; Pierce, Benjamin C.; Zdancewic, Steve (January 2019, Proceedings of the 8th ACM SIGPLAN International Conference on Certified Programs and Proofs)

Full Text Available

Search for: All records